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METHODS FOR SEPARATING AND /OR IDENTIFYING DNA MOLECULES 

The invention relates to methods for separating and 
identifying DNA molecules in mixtures of DNA molecules 
having the same number of nucleotides but different base 
sequences . 

I.*. Background to the Invention 
1 . 1 General Introduction 

Many genes exist as multiple alleles which differ from each 
other by small differences in sequence. Individuals are 
often heterozygous with respect to the alleles of 
particular genes; i.e. individuals often have two different 
alleles of the same gene. 

In some circumstances, it is desirable to separate the 
alleles of a gene from a mixture of the alleles . For 
example, when it is desired to carry out a test to 
determine which alleles of a gene are carried by a 
heterozygous individual, it is often necessary to separate 
the two alleles before carrying out the test because the 
presence of two alleles in one test can prevent meaningful 
results from being obtained. 

In view of the fact that the difference between the alleles 
of a gene can be as little as one nucleotide, it is often 
difficult to separate the alleles from a mixture of the 
alleles . These difficulties are increased in genes which 
have a large number of different alleles, such as the major 
histocompatibility complex (MHC) genes (e.g. the human 
leucocyte antigen (HLA) class I and II genes which have 
over 500 known alleles) . 

Identity of alleles is frequently essential in a clinical 
setting; for example, HLA matching between a bone marrow or 

SUBSTITUTE SHEET (RULE 26) 
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kidney recipient and donor is one of the major factors 
influencing transplant success. Up to date the most 
favourable bone marrow transplant (BMT) and kidney 
transplant results have been obtained using sibling donors 
who are genotypically HLA- identical to the recipient but 
such donors are available for only about 30% of patients* 1 " 
51 . BMT using unrelated donors can be successful, but 
these transplants have higher rates of graft failure, 
increased incidence and severity of Graft versus Host 
Disease and more frequent complications related to delayed 
or inadequate immune reconstitution (4) . 

New molecular biological methods for detection of genetic 
polymorphism currently provide an opportunity to improve 
e.g. HLA matching of unrelated donors as well as a research 
tool to investigate the relationship between disparity and 
transplant complications. These molecular typing methods 
include sequence-specific amplification, hybridisation with 
oligonucleotide probes, heteroduplex analysis, single 
strand conformation polymorphism and direct nucleotide 
sequencing. 

Each of these molecular approaches has been used for 
routine HLA class II typing {6) , but a variety of reasons 
related to the HLA class I gene structure has complicated 
their application to class I typing. The reasons for these 
limitations are the extensive polymorphism of each class I 
locus and the degree of sequence homology between the loci . 
In addition, sequence homology between class I classical 
and non-classical genes and the reported 12 pseudo genes 
can cause problems for specific locus amplif ication (7> . 

The low extent of "allele specific" sequences at 
polymorphic sites is a feature of the HLA class I genes 
that has limited. the resolution of all current DNA typing 
approaches. An "allele specific" sequence is a sequence 
that is only present in one allele and can therefore be 



WO 97/20070 



PCT/GB96/02960 



- 3 - 

used to distinguish the allele from other alleles. 

The main problem which complicates the identification of an 
allele is the presence of a mixture of alleles, as well as 
5 contamination by segments of DNA which have homology to the 
allele it is wished to identify and which are co-amplified 
in PCR. Current typing methods are sometimes unable to 
resolve the allele it is wished to identify from 
contaminating DNA fragments. Separation techniques such as 
10 single strand conformation polymorphism (SSCP) can only 
partially resolve this problem. 

Methods for allele separation 

15 1 . 2 Sequence specific primer amplification (PCR-SSP) 

This method utilises both the group-specific and, when 
present, allele-specif ic sequence sites in PCR primer 
design. The SSP design is based on the amplification 
20 refractory mutation system (ARMS) , in which a mismatch at 
the 3' residue of the primer inhibits non-specific 
amplif ication (8,9> . 

Although each SSP reaction may not individually provide 
25 sufficient specif ity to define an allele, the use of 
combinations of sequence specific primers allows the 
amplification of their common sequences to give the desired 
HLA specificity. 

30 However, despite its high accuracy, PCR-SSP is only in some 
cases more informative than serology. The reason for this 
is the low occurrence of allele specific sequence motifs in 
the exons and this limitation has stimulated a vast amount 
of research into the identification of allele specific 

35 motifs even in the intron sequences uo) . However, up to 

date this approach has not contributed considerably to the 
identification of more alleles. 
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Another limitation of this method is that it detects a 
limited number of polymorphic sequences which are utilised 
to predict the entire sequence. If an unknown allele is 
present in a particular sample this extrapolation may be 
incorrect . 



In addition, the successful use of the technique relies on 
group specific amplification and therefore prior knowledge 
of broad HLA specificity is needed. 

1 . 3 Single strand conformation polymorphism (SSCP) 

This technique is based on the electrophoretic mobility of 
single stranded nucleic acids in a non-denaturing 
polyacrylamide gel, which depends mainly on sequence- 
related conformation 111 " 131 . The technique can be employed 
for isolating single alleles which could then be used for 
further manipulation and analysis such as direct 
sequencing. The pattern of bands obtained after 
electrophoresis may be diagnostic for an allele* 14 ' 15 * . 

The major disadvantage of SSCP is the tendency of DNA 
single strand to adopt many conformational forms under the 
same electrophoretic conditions resulting in the presence 
of several bands from the product; this makes the 
identification more difficult. In addition there is a high 
degree of variation and inconsistency in the sensitivity of 
this method for detecting mutations or allelic variations 
and there is a physical limitation in the size of the DNA 
fragment which is of the order of 200-400 base pairs <16) . 

1.4 Denaturing Gradient Gel Electrophoresis (DGGE) and 
Temperature Gradient Gel Electrophoresis (TGGE) < 17 ' 18> 

The underlying principle of both techniques is the 
difference in the degress of melting between two alleles 
(double stranded DNA) which results in a reduction of 
mobility of the DNA fragments in polyacrylamide gels 
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containing a denaturing reagent (DGGE) or a temperature 
gradient (TGGE) . 

Both techniques have been used frequently for screening 
mutations in genetic systems with one or two variants. 
They are only rarely used for the separation of alleles in 
highly polymorphic systems such as HLA. 

Both techniques require specific conditions for a 
particular system under investigation and, in addition, 
where two alleles share common sequence segments with low 
melting points they may not always be differentiated. The 
simultaneous melting of both alleles will produce very 
similar retardations. 

1 . 5 Cloning of DNA 

This is the classical method of preparation of a single 
sequence, i.e. the sequence derived from a single allele. 
A variety of constructs has been used to introduce the 
required DNA fragment into a plasmid and grow sufficient 
copies for analysis. This method yields pure samples of 
the analyte, but is time consuming to perform and several 
clones are normally tested to ascertain the homogeneity of 
the product. 

Methods for the identification of alleles 

1 . 6 Heteroduplex analysis 

Fully matched DNA duplexes are more stable than those with 
base mismatches. Instability of the duplex increases with 
the number of nucleotide mismatches; these cause formation 
of loops and bends in the linear DNA fragment which produce 
an increasing "drag effect" in polyacrylamide gels which 
retard the affected migrating bands fl9 " 21) . 
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Mismatched DNA hybrids (heteroduplex) may be formed at the 
end of each PCR cycle between coamplified alleles from a 
particular locus or loci due to primer cross reaction at 
sites with similar sequences. During the annealing stage 
5 of each cycle of the PCR / a proportion of sense strands of 
each allele may anneal to anti- sense strands of different 
alleles. The banding pattern obtained in PAGE analysis can 
be useful for identifying the alleles involved in the 
reaction (22 * 24) . 

10 

Heteroduplex analysis is an approach that has been utilised 
to compare HLA genes of a particular donor and recipient. 
HLA genes are amplified, denatured (melted into single 
strands) and mixed together under conditions that promote 

15 renaturation to form double stranded molecules. If the HLA 
genes of a donor and recipient are similar but not 
identical, heteroduplexes will form consisting of one 
strand of an allele of donor origin and a second strand 
from a different allele of recipient origin (25 » 26> . The 

20 sensitivity of this method can be increased by adding DNA 
from an HLA allele that is not present in the donor or 
recipient . 

The major advantage of heteroduplex analysis is that it is 
25 relatively easy and inexpensive. Limitations of this 
approach include inability to detect certain HLA 
disparities, potential detection of irrelevant silent 
mutations and lack of specific information regarding the 
nature of the alleles involved. 

30 

Up to date this approach has been used for HLA class II 
typing with limited success. Its application to class I 
typing has not been successful . 
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1 - 7 Sequence specific oligonucleotide probes (PCR-SSO) 

SSO typing involves amplification of HLA alleles from a 
particular locus followed by hybridisation with a panel of 
5 oligonucleotide probes to detect polymorphic sequences that 
distinguish one allele or group of alleles from all others. 
In polymorphic systems a one step operation may not always 
differentiate all the known alleles; selected primers can 
be used to achieve amplification of individual alleles 
10 which are then identified by specific probes. This second 
stage of oligotyping is often referred to as high 
resolution oligotyping' 61 . 

The advantages of the PCR-SSO method are specificity, 
sensititivity, simplicity, reproducibility, and it is 
relatively inexpensive to operate and allows simultaneous 
processing of many samples. This approach has been applied 
successfully, for example, to typing of HLA class II 
alleles . 

The major methodological drawback of this approach is that 
the complexity of the technique is directly related to the 
number of alleles under investigation and the presence of 
two alleles in the heterozygous condition can complicate 
the identification process. 

Publishing oligotyping methods could result in incorrect 
interpretation of data if certain combinations of recently 
discovered alleles are present in a specimen. It is 
3 0 therefore necessary to update the reagents used in the 

identification step. Several typing approaches for HLA- A 
and B based on PCR-SSO have been published; these typically 
require over 4 0 and 90 probes respect ively (27 ' 28) . The 
operation of these methods is time consuming and the 
35 resolution obtained is only moderate. 
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1.8 Nucleotide Sequencing 



DNA templates for sequencing can be produced by a variety 
of methods, the most popular being the sequencing of cloned 
5 genomic or cDNA fragments, or the direct sequencing of DNA 
fragments produced solely by PCR (as in 1.2 above) . These 
templates represent a single sequence derived from one 
haplotype. Alleles from both haplotypes of a heterozygous 
sample may be co- amplified and sequenced together using 
10 locus-specific PCR primer. 

The recent availability of computer software, which allows 
the user to align the derived sequence against established 
sequence libraries, has facilitated the analysis and allele 

15 assignments for heterozygous samples in which both 
templates are sequenced at the same time (27) . The 
effectiveness of this method depends on the amount and 
frequency of ambiguous heterozygous combinations, for 
example there are many HLA class II alleles that when 

20 present together in one sample cannot be differentiated by 
this method. The number of such ambiguous combinations of 
allele sequences is even greater for HLA class I alleles. 

Up to date two HIiA class I typing approaches based on 
25 direct sequencing have been published. Both require 
serology information followed by allele specific PCR 
amplification and then direct sequencing U4, 30) . More recent 
practice, however, is to amplify DNA fragments without 
prior knowledge of the allele groups and to use locus 
3 0 specific PCR amplification. Theoretically these approaches 
should give the highest resolution, but they are beset by 
ambiguous sequence combinations which cannot be resolved 
satisfactorily and in practice these methods are expensive 
and difficult to perform routinely. 
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2 . Summary of the Invention 



The invention provides methods for separating and/or 
identifying a DNA molecule in a mixture of DNA molecules. 
5 The separation methods comprise 

(i) amplifying the DNA molecules in the mixture; 



(ii) hybridising single strands of the amplified DNA 
10 molecules with a complementary strand of a 

reference DNA molecule so as to form duplexes; 
and 

(iii) separating the duplexes. 

It is possible to isolate the single strands of the 
amplified DNA molecules between steps (i) and (ii) , and 
hybridise these isolated single strands with an isolated 
complementary strand of a reference DNA molecule in step 
(ii) . 



15 



20 



Alternatively, the duplexes of amplified DNA formed in step 
(i) can be used in unisolated form in step (ii) . In this 
case, the amplified duplexes may be contacted directly with 

2 5 a reference DNA molecule containing a labelled strand in 

step (ii) . Only the single strands of the amplified 
duplexes which hybridise to the labelled strand of the 
reference DNA molecule are detected in any subsequent 
detection step. 

30 

The different DNA molecules in the original mixture give 
rise to duplexes having different numbers, positions and/or 
types of mismatches . This allows the duplexes to be 
separated by, for example, gel electrophoresis. The 

3 5 separated duplexes can then be analysed to identify the DNA 

molecules that were present in the original mixture. Two 
embodiments of the method of the invention are illustrated 
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The method of the invention can be used directly as a 
diagnostic technique to identify a DNA molecule by the use 
5 of a specific reference DNA molecule. The formation of a 
homoduplex identifies a DNA molecule in the unknown mixture 
as identical to the reference DNA molecule. The formation 
of a heteroduplex may also be used to identify an unknown 
DNA molecule by using a known heteroduplex as a control. 

10 The invention includes a method for identifying a DNA 
molecule, which method comprises (i) contacting the DNA 
molecule with a labelled reference DNA strand under 
conditions such that the reference stand hybridizes to a 
complementary strand of the DNA molecule so as to form a 

15 test duplex; (ii) running the test duplex and one or more 
control duplex (es) in a gel by electrophoresis; and (iii) 
comparing the position of the test duplex on the gel with 
the position (s) of the control duplex (es) . 

20 The method of the invention can also be used as a 

separation technique for separating the alleles in a 
mixture of unknown alleles of a polyallelic gene # such as 
the mammalian MHC genes (e.g. the HLA genes) . Duplexes 
formed between the unknown alleles and a reference allele 

25 are separated so as to isolate the unknown alleles for 

identification by techniques such as DNA sequencing, SSP 
and SSO. 

The use of a single reference DNA molecule is effective for 
3 0 separating and/or identifying many DNA molecules, such as 
alleles from genetic loci in which there is only a small 
number of different alleles existing in the population. 
However, we have found that the use of two reference DNA 
molecules is unexpectedly advantageous, especially where 
3 5 there is a great number (e.g. from 10 to 3 00 or from 3 0 to 
100) of alleles in the population, such as in the case of 
the HLA genes. For some genes, there may be two or more 
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different alleles that give duplexes that migrate to the 
same position with a single reference strand. We have now 
found that such alleles may be distinguished from each 
other by using a second reference strand that forms 
5 duplexes that migrate to different positions. The combined 
results for two reference strands give a unique parameter 
for each allele. In some circumstances a third reference 
strand may be desirable, but two reference strands will 
resolve most ambiguities of allelic type, even at genetic 
10 loci such as HLA in which there is a great variety of 
allelic type. 

The effectiveness of using two reference strands is 
illustrated by, for example, Figures 7 and 9. These 
15 Figures show that the two HLA class I alleles A*3 001 and 

A*0201 migrate to similar positions when A*0101 is used as 
the reference strand, and it would therefore be difficult 
to distinguish between these two alleles using the A*0101 
reference strand alone. However, when A*0217 is used as 

2 0 the reference strand, the two alleles are separated by a 

great distance. Thus, the combination of reference strands 
A*0101 and A*0217 allows unambiguous identification of the 
alleles, whereas the use of only one reference strand would 
not. Since the HliA loci are among the most complex known, 
25 it is clear that the invention is broadly applicable to a 
wide variety of genetic loci, even those that are complex 
in their genetic variation. 

In one embodiment, the invention provides a method which 

3 0 comprises 



(i) amplifying the DNA molecules in the mixture 

employing a pair of primers in which one of the 
primers carries a ligand, so as to produce an 
3 5 amplified mixture of double -stranded DNA 

molecules in which one of the strands carries a 
ligand; 
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(ii) contacting the amplified mixture of double - 
stranded DNA molecules with a receptor on a solid 
support under conditions such that the ligand 
binds to the receptor; 

5 

(iii) separating the mixture of double- stranded DNA 
molecules into single -strands and removing the 
strands that are not bound to the support by 
the ligand; 

10 

(iv) recovering the remaining strands from the 
support ; 



(v) mixing the recovered strands with a complementary 
15 strand of a reference DNA molecule so as to form 

dup 1 exe s ; and 



(vi) separating the duplexes. 



20 This embodiment and all subsequent applications of the 

embodiment may be modified by recovering the single -strands 
that do not bind to the support in step (iv) , and using 
these strands instead of the strands which bind to the 
support to form duplexes in step (v) . 

25 

The complementary strand of the reference DNA molecule may 
be provided by essentially the same technique as the 
technique set out above in steps (i) to (iv) for providing 
the mixture of DNA molecules in single- standard form. In 
30 particular, the reference complementary strand DNA molecule 
may be provided by 

(i) amplifying the reference DNA molecule employing 
a pair of primers in which one of the primers 
35 carries a ligand, so as to produce amplified 

double- stranded reference DNA molecule in which 
one of the strands carries a ligand; 
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(ii) contacting the double -stranded reference DNA 
molecule with a receptor on a solid support under 
conditions such that the ligand binds to the 
receptor; 

(iii) separating the double -stranded reference DNA 
molecule into single-strands and removing the 
strand that is not bound to the support by the 
ligand; and 

(iv) recovering the remaining strand from the support. 



In another embodiment of the invention, there is provided a 
15 method which comprises 

(i) amplifying the DNA molecules in the mixture 
employing a pair of primers in which one of the 
primers carries a high molecular weight molecule, 

20 so as to produce an amplified mixture of double - 

stranded DNA molecules in which one of the 
strands carries a high molecular weight molecule; 

(ii) separating the mixture of double- stranded DNA 
25 molecules into single strands; 

(iii) mixing the single strands with a complementary 
strand of a reference DNA molecule so as to form 
duplexes; and 



30 



(iv) separating the duplexes 



The complementary strand of the reference DNA molecule may 
3 5 be provided by 



PCT/GB96/02960 

- 14 - 

amplifying the reference DNA molecule employing a 
pair of primers in which one of the primers 
carries a high molecular weight molecule, so as 
to produce an amplified double- stranded reference 
DNA molecule in which one of the strands carries 
a high molecular weight molecule; and 

(ii) separating the double -stranded reference DNA 
molecule into single strands . 

This embodiment overcomes the need for solid support 
systems by conjugating one primer of a pair of primers 
directly to a high molecular weight molecule (e.g. a 
protein), for the reference and test systems. The 
amplified product after hybridisation can be applied 
directly to a separating gel. The high molecular weight 
conjugates are retained in the gel compared to the duplex 
without attachment of the high molecular weight molecule. 

In a further embodiment of the invention, there is provided 
a method which comprises 

(i) amplifying a single strand of each of the DNA 
molecules in the mixture; 

(ii) mixing the amplified single strands with a 
complementary strand of a reference DNA molecule 
so as to form duplexes; and 

(iii) separating the duplexes. 

In this embodiment, the complementary strand of the 
reference DNA molecule may be provided by amplifying a 
single strand of the reference DNA molecule. The 
amplification of the single strand of the reference or test 
DNA molecule can be done, for example, by asymmetric PCR. 
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10 



15 



This embodiment overcomes the need for both solid support 
systems and conjugation of one primer of a pair to a high 
molecular weight molecule. However, in the embodiment it 
is possible to use a primer carrying a ligand such as a 
hapten in order to facilitate capture of the amplified 
strand with a receptor such as an antibody and separation 
of the amplified strand from other components in the 
amplification mixture. 

After separation of the DNA molecules by one of the above 
methods, the molecules present in the mixture may be 
identified by carrying out one or more of the following 
steps : 

(i) comparing the positions of the separated duplexes 
on the gel with the position of a control DNA; 

(ii) sequencing each of the separated molecules; 

(iii) sequence specific primer (SSP) amplification 
analysi s ; and 

(iv) sequence specific oligonucleotide (SSO) analysis. 

The invention provides an improvement over prior methods 
for separating DNA molecules. The advantages offered by 
the invention can be summarised as follows: 

(a) The invention provides a high resolution between 
different DNA molecules and differences of as 
little as one nucleotide between molecules can be 
detected. 

(b) The invention allows simultaneous and rapid 



processing of a large number of samples. 
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(c) The invention is comparatively inexpensive to 
perform, particularly when compared to prior 
methods which achieve a high level of resolution. 

(d) The invention uses techniques that can be 
performed easily without recourse to complex and 
expensive technology . 

3^ Principle underlying the invention 

Fully matched DNA duplexes are more stable than those with 
base mismatches. Regions of nucleotide sequence which are 
complementary retain the double stranded structure, but 
mismatched regions form single -stranded loops which induce 
kinks along the length of the DNA molecule. Relative to a 
reference strand, the number, size, composition and 
position of the single-stranded loops vary for each allele. 
Heteroduplex DNA migrates more slowly than the 
corresponding homoduplex DNA. Both denaturing reagents 
and/or heat enhance the effect of the mismatches. Since 
the rate at which the DNA migrates in polyacrylamide or 
special agarose gels depends on both molecular conformation 
and molecular weight, the described method of introducing 
conformational changes in the hybrid duplex provides the 
basis of highly specific separation of alleles. 

As the molecular conformation of heteroduplexes can be 
manipulated by hybridisation of a known single strand 
reference with unknown complementary single strand (s) , it 
is proposed that heteroduplexes can be separated from each 
other by e.g. denaturing or non- denaturing polyacrylamide 
electrophoretic analysis. This allows the separation of 
the two amplified alleles from a particular locus for 
further analysis. In addition, the method of the invention 
(which we call "Complementary Strands Analysis" (CSA) ) , 
permits assessment of the quality of the PCR product before 
the process of identification is carried out. CSA is able 
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to identify the presence of coamplif ied non-desirable 
alleles from different loci and, potentially, PCR fragments 
that contain artifacts such as Taq errors and in vitro 
recombinations . 

5 

In addition, CSA itself can be used as a diagnostic 
technique. It can identify alleles by hybridisation of 
allele specific single strand with unknown complementary 
single strand (s) followed by e.g polyacrylamide gel 

10 electrophoretic analysis, with or without denaturing 

conditions and/or with or without a temperature gradient. 
The formation of a homoduplex demonstrates identity between 
at least one of the unknown alleles and the allele specific 
reference, and non-identity between them produces 

15 heteroduplex(es) . 

i-i. Detailed Description of the Invention 

The kinds of DNA molecule that may be separated and 

2 0 identified by the methods of the invention include alleles 

of polyallelic genes, segments of genes and non-expressed 
fragments . 

Examples of genes with multiple alleles to which the 
25 invention may be applied are the mammalian MHC genes such 
as the HLA class I and class II genes, the T cell receptor 
genes in mammals < 33 - 34 >, TAP, LMP, ras <32) , non classical HLA 
class I genes, the genes for human complement factors C4 
and C2, Bf in the human HLA complex, and genes located in 

3 0 mitochondrial DNA, bacterial chromosomes and viral DNA. 

The invention can be used in the analysis and 
identification of mutations (e.g. point mutations) in these 
and other genes and chromosomal aberrations such as 
translocation, deletions and inversions. 



35 
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There are three different genes within the HLA class I 
group of genes, namely HLA-A, HLA-B and HLA-C, and each of 
these three genes exists in the form of multiple alleles. 
There are a total of about 222 known alleles of the HLA- A, 
HLA-B and HLA-C genes and the sequences of known alleles 
are set out in Arnett and Parham (1995) Tissue Antigens 45. 
217-257. There are also multiple genes within the HLA 
class II group of genes, known as DR, DQ and DP. 

In the method of the invention, it is necessary to identify 
primer sequences unique for the target gene so as to 
include all polymorphic sites of interest in the amplified 
fragment, which should also be manageable in length. For 
example, the polymorphic sites in exons 2 and 3 of HLA 
class I would facilitate the identification of all 
recognised alleles of HLA-A, B and C, with 5 exceptions. 
Therefore, the primers used in the invention may, for 
example, be selected so as to specifically amplify exons 2 
and 3 of each of HLA-A, HLA-B and HLA-C separately. 
Cereb <31) and collaborators have described primer sequences 
located in the first and third exons which can be used for 
locus- specif ic amplification of the entire exon 2 and 3 
region of each of the HLA-A, HLA-B and HLA-C genes. The 
sequences of suitable primers are given in Example 1 below. 

The reference DNA molecule used in the invention may have a 
known sequence. The reference may be chosen so as to have 
a similar allotype to an allotype that at least one of the 
test alleles is suspected of having. For example, it may 
be known that a test allele is of the HLA- AO 2 type from 
serological data, but it may not be known which of the 
seventeen A02 sub- types the allele is. In this case, the 
reference allele may be chosen to be of sub- type A02 01 and 
the method of the present invention could then be used to 
determine which of the A02 sub- types the test allele is. 
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The reference strand may be obtained from (a) a homozygous 
source, (b) a heterozygous source from which individual 
strands are isolated by gel separation after amplification 
steps or (c) DNA synthesis. There are now about 500 
5 internationally recognised cell lines which contain HLA 
alleles of known sub-type and these cell lines can be used 
as a source of reference alleles . 

The control DNA used in the method of the invention may be 
0 a homoduplex between two strands of the same DNA molecule 
(e.g. the reference DNA molecule), so that migration of a 
test duplex to the same position on the electrophoretic gel 
as the control homoduplex indicates that the test duplex is 
a homoduplex. If the test duplex is a homoduplex, it can 
5 be concluded that the unknown DNA molecule is the same as 
the reference molecule. 

Control DNAs may be obtained by simply amplifying a known 
DNA molecule using the same primers as used in the method 
) of the invention to amplify the reference and unknown 
molecules . 

The control DNA may also be a heteroduplex of known DNA 
molecules. This allows the method of the invention to be 
used to identify molecules in heteroduplexes formed by test 
samples. The same heteroduplexes from different sources 
migrate to the same position on a gel. 

A potential problem with identifying molecules in 
heteroduplexes is that certain different heteroduplexes may 
contain combinations of mismatches which cause them to 
migrate to the same position. A method for overcoming this 
problem would be to use a second reference strand which 
provides different combinations of mismatches. 
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In typing of the HLA class I alleles, the different 
duplexes could be identified by their different sizes (the 
HLA- A, HLA-B and HLA-C genes are different sizes) or by 
amplifying each of the HLA- A, HLA-B and HLA-C genes with 
5 primers carrying different labels. Each locus specific 
heteroduplex would have a different size or carry a 
different label, and could be electrophoresed 
simultaneously in the same track of a gel. The duplexes 
could then be identified by comparing them to control 
10 duplexes in the same gel. Examples of suitable labels 

include radiolabels, colour labels and fluorescent labels. 

The mixture of alleles used in the method of the invention 
may be from a prospective donor or a prospective recipient 
15 in a tissue or organ transplant operation. The results of 
the method may therefore be used to match a prospective 
recipient with a prospective donor. 



In one embodiment of the invention, the alleles of the 
prospective donor or of the prospective recipient are in 
effect used as reference alleles and duplexes are formed 
between strands of the prospective recipient's alleles and 
of the prospective donor's alleles. Analysis of the 
duplexes formed between the strands from the prospective 
recipient and donor reveals whether they have the same 
alleles. Thus, in one embodiment, the invention provides a 
method for determining whether a prospective recipient in a 
tissue or organ transplant operation has alleles of a gene 
that are compatible with the alleles of a prospective donor 
in the operation, which method comprises 

(i) amplifying the alleles of the prospective 

recipient employing a pair of primers in which 
one of the primers carries a ligand, so as to 
produce amplified double -stranded alleles of the 
prospective recipient in which one of the strands 
carries a ligand; 
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(ii) contacting the amplified double -stranded alleles 
with a receptor on a solid support under 
conditions such that the ligand binds to the 
receptor; 

5 

(iii) separating the double -stranded alleles into 
single -strands and removing the strands that are 
not bound to the support by the ligand; 

10 (iv) recovering the remaining strands from the 

support ; 

(v) mixing the recovered strands with complementary 
strands of the alleles of the prospective donor 

15 so as to form test duplexes; 

(vi) separating the test duplexes by gel 
electrophoresis; and carrying out one or more of 
the following steps: 

20 

(vii) comparing the positions to which the test 
duplexes migrate on the gel with the position of 
a control DNA; 

25 (viii) sequencing one or both strands of each of the 

test duplexes; 

(ix) sequence specific primer (SSP) amplification 
analys i s ; and 



30 



(x) sequence specific oligonucleotide (SSO) analysis 



Other proposed uses of the invention include determination 
of the paternity of an individual by identifying one (or 
35 more) of his alleles to see if it is the same as a 

corresponding allele of a putative father. The invention 
may also be used in forensic medicine to determine the 
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origin of a sample of body tissue or fluid, as a follow up 
technique in treatment of haematological malignancies or 
inherited disorders, in adoptive immunotherapy, and in 
identification of bacteria and viruses. 

5 

In the method of the invention, the amplification steps may 
be carried out by polymerase chain reaction (PCR) . 

The ligand/receptor system used in the invention may, for 
10 example, be the biotin/streptavidin system or a 

hapten/antibody system. Direct conjugation of the primer 
via a linking group, such as short poly A, to the beads is 
an alternative. When the biotin/streptavidin system is 
used, one of the primers used in each of the amplification 
15 steps may be labelled with biotin, so that when the 

amplification reaction is carried out double -stranded DNA 
is produced in which one strand carries a biotin label. 
The double -stranded DNA may then be bound to a solid 
support coated with streptavidin. 

20 

The solid support used in the invention is typically 
magnetic beads. However, other supports may be used, such 
as the matrix of an affinity chromatography column. When 
the support is in the form of magnetic beads, the two 

25 strands of the amplified DNA are separated by attracting 
the beads to a magnet and washing the beads under 
conditions such that the double -stranded DNA dissociates 
into single - strands . The dissociation is typically 
performed by incubating the beads (e.g three times) under 

3 0 alkaline conditions (e.g. 0.1 M or 0.15 M NaOH) at room 

temperature for about 5 or 10 minutes. Usually, the strand 
which is not bound to the support by the ligand is then 
discarded, although it is equally possible to retain the 
strand that is not bound to the support and discard the 

35 strand that is bound to the support. 
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The strand that remains attached to the support may be 
recovered from the support by incubating the support under 
conditions such that the ligand/receptor complex 
dissociates. When the biotin/streptavidin system is used, 
the support is typically heated to e.g. 95 °C for about 5 
minutes; this ensures denaturation of the streptavidin 
molecule to release the biotinylated single strand which is 
then recovered. 



0 At this stage, there have been provided a single-stranded 
unknown allele and the complementary strand of a reference 
allele. The two strands are then mixed together under 
conditions in which they hybridise to form duplexes. 
Typically, the hybridisation step is performed by heating 

5 the mixture of strands at about 95 °C for about 3 min, at 
about 70 °C for about 5 min and then at about 65 °C for about 
45 min. 



Under these conditions, duplexes are formed which can 
subsequently be separated by gel electrophoresis (e.g. 
polyacrylamide gel electrophoresis) . The electrophoresis 
may be carried out under denaturing conditions because this 
may enhance separation. 

As an alternative separation technique to gel 
electrophoresis, high pressure liquid chromatography (HPLC) 
may be used. 

In the embodiment of the invention in which one of the pair 
or primers is conjugated to a high molecular weight 
molecule, the molecule may be a protein such as bovine 
serum albumin (BSA) . The molecular weight of the high 
molecular weight molecule is such that it causes the DNA 
molecule to which it is attached to be sufficiently 
retarded in the separation step {e.g. the electrophoresis 
step) to allow the DNA molecule to be separated from a 
duplex without a high molecular weight compound attached. 
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For example, the molecular weight of the high molecular 
weight molecule may be from 10 to 200 kDa, preferably 20 to 
100 kDa. 

5 In a preferred embodiment of the invention, DNA fragments 
from a particular segment of a genome can be separated and 
identified with the use of a labelled reference strand 
without the isolation of the duplex from the separation 
phase. This approach can be automated with the use of 
10 currently available technology for simultaneous resolution 
and identification of alleles. 

In this diagnostic use of the method, separation of the 
sample DNA fragments into a single strand or the use of 
15 ligand bearing primers is not necessary. The pre-labelled 
reference strand is mixed with the sample and, following 
the hybridisation step, the mixture is analysed by a method 
which will separate the DNA duplexes (eg electrophoresis or 
high pressure liquid chromatography) . This embodiment is 

2 0 referred to as "Double Strand Conformation Analysis" 

(DSCA) . 

The selective identification of the marker bearing 
reference -sample duplexes excludes those hybrids which have 
25 not complexed with the reference strand. The position of 
the identified bands is diagnostic. These can be 
accurately assigned by the inclusion of reference mobility 
markers, which could be (a) internal markers with faster 
and slower mobilities than the duplex under investigation 

3 0 or (b) multiple markers that, for electrophoretic runs, 

would be resolved simultaneously in a separate track and 
would form a reference ladder of graded mobilities. 

In this embodiment, the DNA sample fragments can be 
35 obtained by (a) specific amplification, e.g. PCR, or (b) 

enzymatic digestion of the genome where the number of genes 
is limited (e.g. viral genomes, microbial plasmids, vector 
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cassettes or the segments from (a) ) . 

The reference strand, which is partially complementary in 
sequence to the intended sample target, may be prepared by 
i introducing specific nucleotides at selected positions 
which enhance the separation of the duplexes during the 
separation stage. The reference strand may be synthetic, 
i.e. man made and not from any naturally-occurring allele. 
An optimum reference can be designed which produces an 
optimised distribution of bands. A second and further 
references can be used which improve the resolution of the 
bands which are not well resolved by the use of a single 
reference. This improves the accuracy of the 
identification, in particular for closely related alleles. 

The reference may be synthesised: 

a. By a combination of specific primers which generate a 
strand with three nucleotide differences at 3' end; this 
would ensure the specificity of the amplification following 
the separation of the sample strands. 

b. By preparing short fragments of target sequence either 
by specific amplification or where necessary synthetically. 
After specific alterations to the sequences these fragments 
would be ligated to produce a single reference strand. 

c. The reference strand may have a fluorescent ligand 
attached at one end for identification or it could carry a 
ligand or compound (e.g. biotin) that would allow 
attachment of an enzyme molecule (e.g. via streptavidin) . 
The reference-sample duplex can then be identified either 
by fluorescence detection methods or by enzyme amplified 
methods using e.g. a colourometric or chemi luminescent 
technique. 
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This approach is suitable for automated separation and 
detection. In order to automate the analysis, it is 
proposed to include mobility markers in the separation 
phase. These labelled marks would be either defined 
5 segments of genomic DNA prepared by amplification or 

synthetically prepared so as to act as reference points for 
automatic computation of the exact position of the sample 
under investigation. 

10 Use of the inv ention to analyse ras oncogene point 

mutations:- ras has been implicated in the oncogenesis of 
many tumours and appears to be activated by point 
mutations. These mutations can occur in all three ras 
genes (N-ras, Harvey-ras and Kirsten-ras) at codons 12/13 

15 and 61 with corresponding amino acid substitutions in ras 
proteins (p21) . These point mutations can be detected by 
application of the invention. 

Two pairs of primers are needed, one for the 12/13 codons 
20 and one for codon 61. The primers described by Lyons (32) 
can be used, with modification by covalent attachment of a 
ligand to one primer of each pair for each of the test 
fragment and the reference fragment. For example, a ligand 
may be attached to primer Pla (a 12/13 codon primer) and 
25 primer Plb (a 61 codon primer) for the reference fragment, 
and to primer P2a (a 12/13 codon primer) and primer P2b (a 
61 codon primer) for the test fragment. In this way, 
complementary ligand- labelled single strands for the 
reference and test fragments are obtained. The 
3 0 complementary strands are hybridised and subjected to 
electrophoresis. Detection of a homoduplex between the 
test fragment and a mutant reference fragment will indicate 
the presence of the mutation in the test fragment . 
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Use of the inventi on to identify T cell receptor (TCR) 
rearrangements in T cell tumours and in adoptive 
immunotherapy;- some T cell tumours can be monoclonal in 
origin and a proportion of the T cells from a patient may 
5 carry a particular rearrangement of the T cell variable 
domain genes alpha/beta or gamma/delta depending on the T 
cell type. The efficacy of a particular treatment or the 
course of the disease can be evaluated by the 
identification of the malignant clone TCR rearrangement. 
10 The method of the invention with the use of suitable number 
of controls can be made semi -quant at ive, which would allow 
the evaluation of the progress of the treatment or the 
disease . 



15 In adoptive immunotherapy, a specific rearrangement of the 
variable domain genes of the TCR can be used as a marker 
for the selected cytotoxic T cell that has been generated 
in- vitro. Post infusion fate of these cells can be 
monitored by a semi -quant at ive detection of the particular 

2 0 rearrangement . 



In both methods the T cell variable domain gene primers 
(33 ' 34) can be modified by covalent attachment of a ligand at 
the 5' end of the primer pair, and the reference strand 

2 5 will be selected to be complementary to the test DNA 

fragments . 

The following Examples illustrate the invention. 

3 0 Brief Description of the Drawings 

Figure 1 shows a schematic overview of an embodiment of 
the invention. 

3 5 Figure 2 shows a schematic overview of another embodiment 
the invention. The difference in alleles A*0201 and A*0206 
is one nucleotide at position 102 on exon 2. The reference 
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strand has adenine at that position and there are other 
differences the reference and the sample sequences. Sense 
or antisense DNA strands may be used providing that the 
base sequence of reference DNA is complementary to the 
5 unknown sample DNA. 

Figure 3 shows the band pattern on gel electrophoresis 
between the sense reference strand of the allele HLA-A*0101 
and anti-sense strands from several locus A alleles. Lane 
10 1: STEINLIN (A*0101) , Lane 2: KIME (A*0211 -A*3201) , Lane 
3: DAUDI (A*0102 - A*6601) , Lane 4: EA (A*0301) , Lane 5: 
LCL721 (A*0101 - A*0201), Lane 6: M7 (A*0202-A0301) , Lane 
7: CJO-A (A*1101) , Lane 8: T5-1 (A0101 - A*0201) , Lane 9: 
L0541265 (A*0101) , Lane 10 AM <A*0205 - A3201) . 

15 

Following HLA A locus specific amplification of exons 2 & 
3, the antisense DNA strands were isolated as described in 
Example 1 below and hybridised with complementary locus 
reference strand. Samples were applied to PAG/agarose 

20 cassette and electrophoresis was performed at 23 0 volts for 
6 hrs at room temperature; the gel was stained with SYBR 
Green I. Single bands were obtained from homozygous cell 
lines and double bands from heterozygous lines. Duplexes 
from identical alleles have the same mobility in the gel. 

25 Samples in lanes 5 and 8 are from alleles A0101 and A0201, 
and samples in lanes 1 and 9 are from homozygous line with 
A0101 which has the same mobility as the fast lines in 
lanes 5 and 8 . 

3 0 Figure 4 shows analysis of HLA class I locus DNA fragments 
from three cell lines separated by CSA in non- denaturing 
PAGE with the use of reference strands from each locus. 

PCR products of locus specific amplification were 
35 processed to isolate anti-sense single strands, and these 
were hybridised with complementary reference strands 
(A*0101, B*4402 and Cw*0701) as appropriate. 
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M=marker DNA fragments; HS67 alleles, A=1101 and 2402, 
B=5801 and 6701, Cw=0701/unknown; M7 alleles, A=0202 and 
0301, B=5301 and 35(serlogy), Cw=4 (serology) homozygous; AM 
alleles, A=0205 and 3201, B=4901 and 5301? (Standard for 
identification not available), Cw=0701 and 10 (serology) . 
For serological types allelic standards were not available 
for identification. The homozygous Cw sample M7 yields a 
single band (lane 7) . 



10 Figure 5 shows the results of an experiment in which CSA 

was used to cross-match HLA-B alleles in an attempt to 

identify a potential bone marrow donor from a family of 
seven. 



15 Parents and siblings were typed by serology only and CSA 

was used to identify a match. DNA was amplified with locus 
specific primers and the PCR products were processed to 
isolate anti-sense single strands. These were hybridised 
with complementary reference strand B*4402. The duplexes 

20 were analysed in non-denaturing PAGE. Direction of 
migration is marked by arrow. 

The complete allelic profile of the family is not 
available. The patient (Lane 4) has inherited maternal 
25 allele B7 and paternal allele B44 at this locus. Father 
has B44 (slow) allele and an unidentified second allele; 
mother has B51 (slow) and B7 (fast) alleles. One sibling 
(lane 5) has identical B alleles with the patient. 

30 Figure 6 illustrates the use of CSA to identify mutations 
in exon 13 of the cystic fibrosis gene. 

DNA from a healthy individual and two cystic fibrosis 
patients were used to amplify exon 13 of the cystic 
35 fibrosis gene by PCR; single strand DNA was isolated and 
hybridised with complementary reference single strand wild 
type; the duplexes were analysed in non-denaturing PAGE. 
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Vertical arrow indicates direction of migration of bands; 
horizontal arrows point to the position of the bands. 

The wild type in Lane 1 yields a single homoduplex (fast 
5 band) . The patients have mutation in exon 13 and are 
heterozygous for this exon. The slow band in Lane 2 
represents heteroduplex due to deletion of a single 
nucleotide (T) at position 2603 in the exon. In lane 3 the 
slow band is due to heteroduplex formation with the mutated 
10 allele R709X which differs from the wild type by a single 
substitution. 

Fiqure 7 shows fluorescent peaks observed in one embodiment 
of double strand conformation analysis (DSCA) and 
15 illustrates the use of two reference strands. 

HLA-A 0201 and 3001 alleles from two samples were amplified 
and hybridised with two reference strands. The results in 
the upper box show duplex migration with reference A*0101 

20 and the lower box with reference A*0217. The difference in 
the separation of the peaks is due to the number of 
nucleotide mismatches which affect the migration rates of 
the respective duplexes. This approach allows accurate 
identification of alleles which may have closely migrating 

25 bands that result in overlapping peaks with only one 
reference strand. 



Fiqure 8 shows the sensitivity of the DSCA method in the 
analysis of HLA-B and Cw alleles. 

i0 

The upper box shows the analysis of HLA-B locus for cell 
line DAUDI. The fluorescent peaks detected distinguish 
between the two alleles B*5801 and B*5802 which differ by 
three nucleotides. The cell line WT49 is homozygous for 
5 B*5801 allele. The position of the peaks for B*5801 is 

identical irrespective of its source. The reference strand 
in this experiment was B*4402. 
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The lower box shows the analysis of the HLA-Cw locus from 
cell line CLA. This cell line is heterozygous for this 
locus and the alleles Cw*0701 and Cw*0702 differ by two 
nucleotides. The reference strand in this experiment was 
5 Cw*0303. 

Figures 9 and 10 show the analysis of HLA-A alleles by 
DSCA using two reference strands. DNA was obtained from B- 
lymphoblastoid cell lines and amplified as described in 
10 Example 3. Hybridised duplexes were analysed by non- 
denaturing PAGE in an automated DNA sequencing instrument . 
Vertical axis indicates RV figure. 

Figure 11 shows a graphic analysis of the combined results 
15 from Figures 9 and 10. The plot of RV from the two 

reference strands allows unambiguous identification of the 
alleles typed by DSCA. 

Figures 12 and 13 show the analysis of HLA-B alleles with 
20 the use of two reference strands. DNA was obtained from B- 
lymphoblastoid cell lines and amplified as described in 
Example 3 . Hybridised duplexes were analysed by non- 
denaturing PAGE in an automated DNA sequencing instrument . 
Vertical axis indicates RV figure. 

25 

Figure 14 shows a graphic analysis of the combined results 
from Figures 12 and 13 . The RV plot from two reference 
strands permits the unambiguous identification of the 
alleles typed by DSCA. 

30 

Figure 15 illustrates the analysis by DSCA of four 
different samples for each of two alleles. In all cases 
the mobilities of the bands for each allele were identical. 
Box A shows the analysis of HLA*03 01; the peak at RV 1000 
35 is the homoduplex in this test run and additional peaks 
indicate heterozygosity of the relevant sample. Box B 
shows the analysis of HLA-B*4402; in this test run all four 
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samples were heterozygous . 

EXAMPLE 1: SEPARATION OF HLA ALLELIC STRANDS BY CSA 
5 1. Methods 



1 . 1 Summary 

Amplification and isolation of the biotinylated anti-sense 
10 strand (s) were performed. They were then hybridised with 
allele specific reference sense single strands. 

In order to test the resolving power of this method, 7 
allele specific reference sense single strands were 
15 prepared. These were hybridised with several isolated 
anti-sense strand HIiA-A alleles which were selected to 
include alleles with one or several nucleotide differences 
compared to the reference strand. 

20 Following this step, PAGE analysis was performed under non- 
denaturing conditions (urea at a range of 2-6 M and 
formamide between 10%- 3 0% concentrations were included in 
other experiments) and the gels were run at either room 
temperature or 50-58°C (200 Volts for 6 hours) . 

25 

The identification of a single band in PAGE gels indicated 
identity between the allele and the reference strand and 
homozygosity of the sample. 

3 0 As a positive control, double strand DNA PCR product from 
the allele reference was always used. 

1 * 2 Locus specific amplification of HLA class I genes 



35 



For typing purposes amplification of exons 2 and 3 is 
desirable, and the primers were therefore selected to 
amplify the stretch of the genome between intron 1 and 
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intron 3. The localisati on and nucleotide sequences of the 
HLA locus -specific primers used are given in the reagents 
section . 

5 PCR reactions were performed in a total volume of 10 0//1 
using 1/xg of genomic DNA and 25 pmoles of each locus- 
specific primer. The 3 '-primer was biotinylated at 5' -end. 
This arrangement ensures the incorporation of the 
biotinylated primer onto the amplified antisense DNA 
10 strand. PCR conditions are given in the following table. 

Thermocvclina conditions 



15 A, B and C loci 

95°C 



4 min. 



1 cycle 



20 



95°C 
65°C 
72°C 



30 sec. 
50 sec. 
3 0 sec. 



33 cycles 



72°C 



8 min , 



1 cycle 



25 



30 



1 . 3 Separation of the amplified DNA strands 
1.3a Removal of non-biotinvlated strand: 

Magnetic beads with covalently coupled streptavidin on the 
surface were added to the PCR product -and incubated for 30 
minutes at 43 °C. In this way the amplified PCR product was 
immobilised by the interaction of biotin and streptavidin. 
After incubation, the tubes were placed against a magnet 
and the beads were washed with washing buffer to remove the 
remaining PCR reaction components. 



3 5 The non-biotinylated DNA strand was then dissociated from 
the beads by incubation with 0.15 M NaOH at room 
temperature (r.t.) for 10 minutes. Following this the 
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beads were washed to remove excess NaOH and resuspended in 
50 fil of hybridisation buffer. 

1.3b Remo val of biotinvlated DNA strand; 

5 

The bead suspension was heated at 95°C for 5 minutes; this 
ensures denaturation of the streptavidin molecule to 
release the biotinylated amplified anti-sense single strand 
which was then removed and placed in a clean tube. 
10 At this stage, the isolates contained single biotinylated 
DNA strands from each allele. 



i^i Preparation of allele specific reference single- 
stranded DNA 

15 

DNA was extracted from 10th IHW cell lines. The following 
homozygous cell lines were selected: 

HLA-A STEINLIN (10th IHW) A*0101 

20 HLA-B SP0010 (10th IHW) B*4402 

HLiA-Cw STEINLIN (10th IHW) Cw*0701 

The PCR conditions for amplification were as above, with 
the exception that in each case the locus-specific 5'- 
25 primer was biotinylated (5' -end). The PCR products were 
analysed by PAGE to assess the fidelity of the 
amplification and in all cases a^single band was obtained. 

1 . 5 Hybridisation 

30 

The biotinylated anti- sense strand (s) from above were mixed 
with the sense strands, and the mixture was heated at 95 °C 
for 3 min., incubated at 70°C for 5 min., and then at 65°C 
for 45 min. Under these conditions, the sense and anti- 
35 sense strands were optimally hybridised and produced good 
yields. The heteroduplexes formed could subsequently be 
separated from each other by electrophoresis in 
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polyacryl amide gel. 
2 . Reagents : 

5 A) Nucleotide sequences of primers used for locus -specific 
amplification: 

5' A locus primer: GAA ACG/C GCC TCT GT/CG GGG AGA AGC AA 
(Intron 1: 21-46) 

10 

3' A locus primer: TGT TGG TCC CAA TTG TCT CCC CTC 
(Intron 3: 66-89) 

5' B locus primer: GGG AGG AGC GAG GGG ACC G/CCA G 
15 (Intron 1: 36-57) 

3 7 B locus primer: GGA GGC CAT CCC CGG CGA CCT AT 
(Intron 3: 37-59) 

20 5' C locus primer: AGC GAG GG/TG CCC GCC CGG CGA 
(Intron 1: 42- 61) 

3' C locus primer: GGA GAT GGG GAA GGC TCC CCA CT 
(Intron 3 : 12-35) 

25 

B) Buffers: 



Washing buffer: 



30 



10 mM Tris-HCl pH 7.5 

1.0 mM EDTA 

2.0 M NaCl (or 6 M LiCl) 



35 



Hybridisation buffer: 



PCR buffer: 



20 mM Tris-HCl pH 8 . 4 

50 mM KC1 

20 mM Tris-HCl pH 8.4 

50 mM KC1 
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0.2 ntM 



MgCl2 



TE buffer 



10 mM 



Tris-HCl pH 7. 5 



1 mM 



EDTA 



C) Various 

Dynabeads M-280 Streptavidin* (10 mg/ml) 
Magnetic particle concentrator -Dynal MPC 

A Thermal cycler (PTC-200 Peltier Thermal Cycler MJ 
Research*) 

Ultrapure dNTP set, 2' -Deoxynucleoside 5' -Triphosphate 
(Pharmacia Biotech) 

Taq DNA Polymerase* (various commercial sources) 
50 mM MgCl2 
0.15 M NaOH 

SeaPlaque Agarose* (Flowgen Instruments Ltd) 

Long Ranger 50% acrylamide (Flowgen) 

* = Trade Names 

Results: Identification of alleles by CSA 

Seven allele specific reference sense strands were isolated 
from homozygous cell lines and hybridised to several anti- 
sense strands from other cell lines whose HLA specificity was 
defined by sequencing and in some cases only by serology. 
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Allelic bands could be identified by homoduplex formation 
with a reference strand, which would migrate at the same rate 
as the double stranded DNA reference band (control band) or 
heteroduplex bands, cathodic compared with the control band. 



It was observed that when the reference strand matched 
completely the anti -sense strand a single homoduplex band was 
visible in the gel. In cases when the reference strand 
differed by 3 or more bases from the anti-sense strand, a 
10 cathodic band corresponding to a heteroduplex was seen. This 
pattern was observed reproducibly for all the allele 
specific reference strands and is therefore- independent of 
the position of the mismatch (es) on the strands and the 
specific base sequence of the allelic reference (Figure 3) . 



In other experiments with longer PAGE it was possible to 
separate allelic strands that were different by only a single 
nucleotide. Use of denaturing PAGE enhanced the separation 
of the allelic bands. 



This method of allele separation has been used to analyse 
HLA-A, HLA-B and HLA-Cw alleles from 22 heterozygous and 41 
homozygous cell lines (HLA-A 33; HLA-B 30 and HLA-Cw 18 
alleles) and to identify the individual alleles without 
ambiguity by a method named "URSTO" and described in ref 35 
and in an international (PCT) patent application being filed 
on the same day as this application in the name of the 
Anthony Nolan Bone Marrow Trust. In particular DNA was 
extracted from 63 B-lymphoblastoid cell lines, which included 
22 heterozygous and 41 homozygous lines. After PCR 
amplification with locus specific primers the anti -sense 
single strands were isolated and hybridised with the 
appropriate reference strands (A*0101, BM4 02, Cw*0701) . The 
allelic bands were resolved in non- denaturing PAGE and eluted 
from low melting point agarose. Alleles were identified 
unambiguously from the isolated DNA. The particular alleles 
isolated are set out in the following Table. 



5 



15 



20 
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Table 1 

HLA CLASS I ALLELES ISOLATED BY COMPLEMENTARY STRAND ANALYSIS 



HLA-A (n=33) 

A*0101, A*0102, A*0201, A*0202, A*0203, A*0204, A*0205, 

A*0206, A*0207 / A*0208, A*0209, A*0210, A*0211 / A*0212, 

A*0213, A*0216 / A*0217, A*0301 / A*1101 , A*2301 / A*2402, 

10 A*2403, A*2501, A*2601, A*2902, A*3001, A*3002, A*3101, 
A*3201, A*3301, A*6601 / A*6602, A*6802. 

HLA-B (n=30) 

6*0702, B*0801, 6*1302, B*1402 # 6*1501, 8*1502, B*1520, 

15 6*1801, 6*3501, 6*3701, 6*3801, B*4001, 6*4002, B*4101, 

8*4201, B*4402, B*4403, B*4601, B*4701, B*4801, B*4901, 

6*5001, B*5101, B*5201, B*5301, B*5502, B*5701, B*5801, 
B*5802, B*6701. 



20 HLA-Cw (n=18) 

Cw*0102, Cw*0202, Cw*0302, Cw*0303, Cw*0304, Cw*0401, 
Cw*0501, Cw*0602, Cw*0701, Cw*0702, Cw*0704, Cw*0802, 
Cw*1202, Cw*1203, Cw*1402, Cw*1502, Cw*1601, Cw*1701. 



25 Number of different heterozygous combinations tested: HLA-A 
19, HLA-B 14, and HLA-Cw 11. 



In further experiments, it was shown that heteroduplexes of 
the same alleles from different sources migrate to the same 

30 position on the gel. Figure 3 shows the results from such an 
experiment. Lanes 1, 5 and 8 contain duplexes comprising 
A0101 from different sources and the duplexes all migrated to 
the same position. Lanes 5 and 8 also contain duplexes 
comprising A0201 from different sources and these migrated at 

35 the same speed, but more slowly than the A0101 duplexes. 

This reproducible mobility of allelic bands can be used as a 



10 



15 



20 



4ft 



4 
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method for the identification of alleles diagnostically . 

EXAMPLE 2 : SEP ARATION OF EXON 13 MUTATIONS OF CYSTIC FIBROSIS 
GENE 

CSA can be used for the isolation of any polyallelic gene. 
This example shows the analysis of the cystic fibrosis gene. 
The largest encoded exon of this gene is exon 13. Samples 
from an unaffected individual which is homozygous for the 
wild type exon 13 and samples from two mutations were 
prepared and analysed by CSA. The mutation samples were from 
individuals with a single base deletion 2603delT (36) and a 
single nucleotide substitution at position 709 - R709X (37) . 
These samples had previously been typed by other methods. 



PCR amplification of the exon 13 of the cystic fibrosis gene 
was performed as described in Zielenski et al ,38) except that 
the 5' primer was biotinylated at the 5' end. The reference 
strand was prepared from the wild type sample and after the 
preparation of the single strands from three samples as 
described in example 1, these were hybridised with the 
reference strand and analysed in non- denaturing PAGE. The 
result is shown in Figure 6. The mutations result in 
chimeric duplexes which result in retardation of the mutated 
25 allelic bands in the gel (see lanes 2 and 3) compared to the 
wild type homoduplex (see lane 1) . 



EXAMPLE 3: ALL ELIC IDENTIFICATION BY DOUBLE STRAND 

CONFORMATIO N ANALYSIS (DSCA) 

30 

This Example shows how DNA fragments from a particular 
segment of a genome can be analysed and identified using a 
labelled reference strand without isolation of the duplex 
from the analysis phase. The Example uses specific reference 
3 5 strands for HLA-A, B and Cw alleles. 



The reference strands were prepared using sense primers for 
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10 



15 



the three loci, which had fluorescent dye (Pharmacia Ltd) at 
the 5' -end. These were not biotinylated. Separate PCR 
reactions were performed for each locus to produce the 
reference reagents. The primer sequences and PCR conditions 
were as described above in Example 1 and the amplified 
reference strands included exons 2 and 3, intron 2 and short 
segments from introns 1 and 3 of each locus. Lymphoblastoid, 
homozygous cell lines were used for each locus amplification 
and a number of reference strands were tested in order to 
obtain optimum separation and least ambiguity in the 
identification of the relevant alleles (see Figure 7 and 
Table 2 below) . For analysis of the HLA class I alleles, the 
use of two reference strands for each locus allows optimum 
differentiation between the alleles tested. Figure 8 
illustrates the resolution of the DSCA method. 



TABLE 2 



DOUBLE STRAND CONFORMATION ANALYSIS TESTING OF REFERENCE 
2 0 STRANDS 



25 



30 



35 





Allele 


RV 


No tested 


RV 


No tested 






REF: A*0I0l 




REF: A*02!7 




A-locus 


A*0201 


1169.0 ± 0.71 


9 


1009.4 ± 0.52 


8 




A*0301 


1085.5 ± 0.55 


6 . 


1143.7 ± 1.03 


6 




A*2402 


1308.8 ± 0.45 


5 


1291.7 ± 0.76 


7 




A*3001 


1 168.0 ± 0.71 


5 


1210.0 + 0.0 


2 






REF: B*4201 




REF: B*4402 




B-Iocus 


B*0801 


1044.0 ± 0.0 




1 204.0 ± 0.0 


3 




B*1302 


1373.7 ± 0.58 


4 


1 171.7 ± 0.58 


3 




B*I 501 


1164.8 ± 0.50 


4 


1331.5 ± 1.00 


4 




B*400l 


1 179.0 ± 0.0 


4 


1230.0 ± 1.00 


3 




B*3801 


1264.0 ± 0.0 


3 


1 124.3 ± 0.50 


4 




B*4403 


1296.7 ± 0.58 


3 


1031.3 ± 0.58 


4 




8*5 101 


1392.5 ± 0.58 


4 


1221.6 ± 0.89 


5 






REF: Cw*0303 




REF: Cw*070l 




Cw-Iocus 


Cw*040l 


113 1.5 ± 0.58 


4 


1432.8 ± 0.50 


4 




Cw*050I 


1063.2 ± 0.84 


5 


1 399.7 ± 0.58 


3 




Cw*0602 


1075.8 ± 0.40 


1 1 


1377.8 ± 0.45 


5 




0*0701 


1782.8 ±0.50 


4 


(reference} 
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RV = relative value, is the measure of the migration of the 
bands. It is arrived at by using the positions of the front 
running primer band and the reference homoduplex (values 1 
and 1000 respectively) in each track, to compute a standard 
curve. The values of the migration of the subsequent bands 
is calculated from this curve. RV is followed by SD for each 
set of tests. 

DNA samples were prepared in the same manner from other cell 
lines using non-biotinylated locus specific primers. The 
amplified sample and the reference preparation were mixed in 
a ratio of 2:1 or 3:1 and hybridised (as in item 2.5 of 
Example 1) and applied to 6% non- denaturing PAG (Long Ranger, 
Trade Name) . The elect rophore tic analysis was performed in 
an ALFexpress DNA sequencer™ (Pharmacia Ltd) and the 
mobilities of the labelled duplexes were determined with an 
integral laser activated system with photodiode detectors. 
The results are shown in Figures 9-14 and in the following 
Table. 



20 



TABLE 3 
HLA-A LOCUS 



25 



30 



35 



A*0l0i 

A*0I02 

AM10I 

A*0301 

A*8001 

A*3J0I 

A*260I 

A*6601 

A*2902 

A*6802 

A*330I 

A*02!3 

A*0203 

A*02I7 

A*300I 

A*0208 

A*020I 

A*021I 

A*02I0 

A*0205 

A*320l 

A*2402 

A*2403 

A*2501 



RV 



1020.6 
1047.8 
1087.8 
1092.8 
1099.3 
1100.0 
1109.6 

n 17.6 

1126.9 
1129.0 
1162.1 
1 1 68.9 
1168.5 
II 69.3 
1 1 70.0 
1173.5 
1177.7 
1180.2 
1195.6 
1278.1 
1320.1 
1320.0 
1333.5 



No tested 



KbhhKbNCE A-0217 



± 
± 
± 
± 
± 
± 
± 
± 
± 
± 
± 
± 
± 
± 
± 
± 

± 
± 
± 

± 



0.7 
1.1 
1.9 
2.0 
2.2 
2.4 
2.8 
2.1 
2.8 
2.7 
3.3 
3.3 
3 
3 
3 
5 

4.7 
3.4 
4.9 
7.6 
8.5 
6.5 
7.3 



.1 



± 5.1 



(reference) 
9 
9 

23 
8 

13 
10 
17 

8 
9 

18 
10 

8 
8 

30 
9 

13 
10 
9 

10 

20 
22 
4 
8 



J 144.0 
1 154.8 
1148.3 
1142.0 
1152.6 
1143.9 
1121.2 
1131.6 
1 157.1 



± 
± 
± 

± 
± 
± 
± 
± 
± 

1073.0 ± 
1 133.7 ± 
1052.0 ± 
I0I6.0 ± 
1000 

1208.7 ± 
10313 ± 

1008.8 ± 
1031^ ± 
1013.0 ± 

1027.8 ± 
1288.4 ± 
1284.2 ± 

1268.9 ± 
1333.0 ± 



6A 
2.9 
3.0 
5.2 
2.9 
3.5 
2.3 
2.5 
3.3 
1.0 
5.0 
1.7 
1.2 

6.0 
0.5 
1.4 
3.1 
0.8 
0.8 
6.8 
6.3 
5.2 
3.8 



Ntr tested 



24 
4 

8 

36 
5 

14 
5 
I 1 

8 
5 

12 

4 

4 

(reference) 
!9 
4 

22 
4 
4 
6 

20 
39 

9 

4 




•ft 
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HLA-Cw LOCUS 



Al I I'J K 



B*080l 
B*I801 
B*0705 
B*0702 
B*370I 
BM401 
B*350I 
BM801 
B*I508 
B*5502 
B*1520 
B*I50! 
BM002 
B*460i 
BM502 
B*I5!9 
B*550I 
B*400l 
B*5601 
B*2704 
B*540I 
B*780! 
BM513 
B*3801 
B*3901 
B*4402 
B*44031 
B*530I 
B*5104 
B*3802 
B*590I 
B*I302 
B*5I0I 
B*520! 
BM5I7 
B*580! 
B*5701 
B*5702 
B*450I 
B*500l 
BM90I 
BM5I6 



"REFERENCE" 
RV 



B*4201 
No tested 



T5EFERERUE" 
"RV 



1043.2 
1092.0 
1096.3 
1100.7 
1 1 15.0 
1120.3 
1122.1 
1136.3 
1138.8 
1 142.4 
1149.7 
1162.8 
U7I.7 
1172.3 
1174.0 
1 174.3 
1 176.3 
1176.8 
1180.0 
N84.8 
1 190.5 
1211.3 
1213.0 
1265.7 
1267.2 
1271.3 
1293.9 
1307.5 
1321.5 
1346.0 
1374.5 
1375.0 
1392.7 
1408.8 
1431.3 
1436.9 
1459.1 
1473.3 
1566.3 
1588.5 
1764.5 
1836.3 



: 0.9 
: 1.5 
: 1.0 
0.8 
3.2 
1.7 
2.2 
3.1 
2.2 
2.9 
3.1 
2.8 
4.9 
3.0 
4.1 
3.6 
1.0 
3.1 
1.8 
3.3 
3.0 
3.3 
1.8 
2.5 
2.7 
3.5 
3.8 
2.1 
4.0 
3.5 
1.3 
6.0 
4.7 
5.3 
5.9 
5.3 
5.8 
4.6 
5.9 
5.6 
12.8 
5.9 



(reference) 
21 
5 
4 
6 
4 
4 

28 
4 
4 
5 
9 

19 

4 

4 

5 
4 
4 

25 
4 
4 
4 
4 
4 

18 

5 

10 
18 
6 
4 
4 
4 
18 
20 
6 
4 
1 1 
10 
4 
4 
13 
6 
4 



TO 

1206.1 
1302.0 
1341.0 
1353.9 
1170.6 ± 2.0 
1272.2 ± 1.6 
1213.6 
1227.8 



3.0 
4.9 
3.9 
3.6 



2.6 
3.2 



1 344.4 ± 4.9 



3.8 
3.0 



1387.3 
1195.8 
I331r?~± 4.5 
1209.8 ± 3.7 

1495.0 ± 3.0 

1332.1 ± 2.7 
1347.0 ± 3.6 
1390.5 ± 0.7 

1229.0 ± 2.3 
1375.5 ± 3.5 
1162.8 ± 2.2 

1430.5 ± 7.8 

1 362.3 ± 5.0 

1215.4 ± 3.0 

1123.1 ± 3.0 
M 24.5 ± 3.5 

1000 

1030.7 ± 1.7 

1093.6 ± 2.4 

1 136.8 ± 2.2 

1114.9 ± 2.0 
1194.8 ± 2.6 
1 169.6 ± 3.0 

1222.8 ± 2.4 

1179.9 ± 2.9 

1340.5 ± 4.8 
1241.0 ± 15.9 

1347.4 ± 13.9 
1334.8 ± 3.2 

1696.5 ± 14.8 

1705.2 ± 4.2 
1598.5 ±12.1 
1727.8 ± 6.4 



No tested 



"RT 

15 

7 
4 
7 
7 
5 

28 
5 
5 
6 

1 I 

2 I 
5 
4 
7 
4 
4 

22 
4 
5 
4 
4 
7 

20 
4 

(reference) 
14 
7 
4 
7 
4 

1 8 
24 
7 
4 
9 

12 

4 

4 

6 

4 

5 



ALLELES 


RV 




No tested 


RV 




No tested 


Cw*0701 


1000 




(reference) 


ntu ± 


0.5 


4 


Cw*0702 


1019.4 ± 


1.4 


8 


1824.6 ± 


8.8 


5 


Cw*0704 


1042.3 ± 


0.6 


4 


1807.3 ± 


3.6 


4 


CW0302 


1341.5 ± 


6.4 


4 


1029.3 ± 


1.0 


4 


Cw*I601 


1350.6 ± 


2.3 


7 


1048.9 ± 


1.8 


7 


Cw*03O3 


1351.9 ± 


5.0 


7 


1000 


(reference) 


Cw*0I02 


1376.2 ± 


6.2 


13 


1074.3 ± 


1.5 


6 


CW0602 


1376.4 ± 


7.0 


28 


1075.6 ± 


1.1 


18 


O*080I 


1377.5 ± 


1.3 


4 


1059.3 ± 


1.3 


7 


CWI202 


1380.0 ± 


5.7 


4 


1064.3 ± 


1.1 


7 


CWI203 


1387.5 ± 


7.6 


10 


1065.9 ± 


1.5 


7 


Cw*l402 


1392.5 ± 


2.6 


4 


1067.3 ± 


1.1 


7 


Cw*I502 


1394.3 ± 


8.3 


4 


1060.4 ± 


1.5 


7 


Cw*050! 


1395.8 ± 


5.0 


13 


1063.2 ± 


2.5 


I 3 


Cw*040I 


1428.8 ± 


6.4 


4 


M31.4 ± 


2.0 


I 3 


CWI70I 


1436.0 ± 


9.4 


5 


1076.5 ± 


3.0 


4 



0 
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The analysis of the combined data from two reference strands 
is demonstrated in Figure 11 for HLA-A and in Figure 14 for 
HLA-B locus. In this graphic analysis, the RV for one 
reference strand is plotted against the RV of a second 
reference. This results in a unique parameter for each 
allele that is diagnostic for each allele tested. 

EXAMPLE 4 : DSCA ANALYSIS OF INTRON 2 SEQUENCES FROM HLA CLASS 
I ALLELES 

It is generally agreed that the extensive polymorphism of the 
HLA alleles is confined mainly to the antigen recognition 
sites. These are encoded for by exons 2 and 3 of the HLA 
class I genes. This segment of the gene which has been used 
in Example 3 for allelic definition by DSCA, includes intron 
2. Small variations in intronic sequences of HLA class I 
genes have been reported (39) and these will influence the 
mobilities of the bands where the sequences are different 
from the reference strand sequence. More important for DSCA 
analysis is the possibility of such variations in an allelic 
sequence that may have arisen in different haplotypes, 
independently of the exon recombinants. The DSCA is 
sufficiently sensitive to test for intra-allelic sequence 
variations . 



Eight common and frequent caucasoid alleles from HLA-A and 
HLA-B loci were investigated. Locus specific amplifications 
were performed as in Example 3. The amplified DNA fragments 
were analysed under both denaturing and non- denaturing 
conditions using several labelled reference strands from both 
loci . 



The numbers of alleles analysed were: 



HLA-A A2 284, Al 148, A3 82, All 52, 

HLA-B B7 114, B8 110, B44 125. 



A32 29. 



• 
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In all cases the allelic bands migrated with identical 
mobility indicating that there was no intra-allelic 
polymorphism in the samples tested (Figure 15) . It would 
therefore appear that the intron sequences are conserved for 
5 each allele. 
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CLAIMS 

1. A method for separating a DNA molecule from a mixture 
of DNA molecules, which method comprises: 

(i) amplifying the DNA molecules in the mixture; 

(ii) hybridising single strands of the amplified DNA 
molecules with a complementary strand of a 
reference DNA molecule so as to form duplexes; 
and 

(iii) separating the duplexes. 

2 . A method according to claim 1 which comprises 

(i) amplifying the DNA molecules in the mixture 
employing a pair of primers in which one of the 
primers carries a ligand, so as to produce an 
amplified mixture of double -stranded DNA 
molecules in which one of the strands carries a 
ligand; 

(ii) contacting the amplified mixture of double- 
stranded DNA molecules with a receptor on a 
solid support under conditions such that the 
ligand binds to the receptor; 

(iii) separating the mixture of double-stranded DNA 
molecules into single-strands and removing the 
strands that are not bound to the support by the 
ligand; 

(iv) recovering the remaining strands from the 
support ; 



(v) 



mixing the recovered strands with 
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complementary strand of a reference DNA molecule 
so as to form duplexes; and 

(vi) separating the duplexes. 

3 . A method according to claim 2 wherein the 
complementary strand of the reference DNA molecule is 
provided by 



(i) amplifying the reference DNA molecule employing 
a pair of primers in which one of the primers 
carries a ligand, so as to produce amplified 
double -stranded reference DNA molecule in which 
one of the strands carries a ligand; 

(ii) contacting the double- stranded reference DNA 
molecule with a receptor on a solid support 
under conditions such that the ligand binds to 
the receptor ; 

(iii) separating the double -stranded reference DNA 
molecule into single -strands and removing the 
strand that is not bound to the support by the 
ligand; and 

(iv) recovering the remaining strand from the 
support . 

4. A method according to claim 2, modified by recovering 
the strands that are not bound to the support instead of the 
strands that are bound to the support , and mixing these 
recovered strands with a complementary strand of a reference 
DNA molecule in step (v) . 



5. A method- according to claim 3, modified by recovering 
the strand of the reference DNA molecule that is not bound to 
the support instead of the strand that is bound to the 
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support, and using this recovered strand in step (v) of claim 
2. 



6 • A method according to claim 1 which comprises 

(i) amplifying the DNA molecules in the mixture 
employing a pair of primers in which one of the 
primers carries a high molecular weight 
molecule, so as to produce an amplified mixture 
of double -stranded DNA molecules in which one of 
the strands carries a high molecular weight 
molecule; 

(ii) separating the mixture of double -stranded DNA 
molecules into single strands; 

(iii) mixing the single strands with a complementary 
strand of a reference DNA molecule so as to form 
duplexes; and 

(iv) separating the duplexes. 

7. A method according to claim 6 wherein the 

complementary strand of the reference DNA molecule is 
provided by 

(i) amplifying the reference DNA molecule employing 

a pair of primers in which one of the primers 
carries a high molecular weight molecule, so as 
to produce an amplified double -stranded 
reference DNA molecule in which one of the 
strands carries a high molecular weight 
molecule; and 



(ii) 



separating the double -stranded reference DNA 
molecule into single strands. 
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8 . 



A method according to claim 1 which comprises 



(i) 



amplifying a single strand of each of the DNA 
molecules in the mixture; 



(ii) 



mixing the amplified single strands with a 
complementary strand of a reference DNA molecule 
so as to form duplexes; and 



(iii) separating the duplexes. 

9. A method according to claim 8 wherein the 
complementary strand of the reference DNA molecule is 
provided by amplifying a single strand of the reference DNA 
molecule . 

10. A method according to any one of the preceding claims 
wherein the reference DNA molecule has a known sequence. 

11. A method according to any one of the preceding claims 
wherein the duplexes are separated by gel electrophoresis. 

12 . A method according to claim 11 wherein the 
electrophoresis is performed under denaturing conditions. 

13 . A method for identifying a DNA molecule in a mixture 
of DNA molecules, which method comprises separating the DNA 
molecules by a method as defined in claim 11 or 12, and 
comparing the positions of the separated duplexes on the gel 
with the position of a control DNA molecule. 

14. A method for identifying a DNA molecule in a mixture 
of DNA molecules, which method comprises separating the DNA 
molecules by a method as defined in any one of claims 1 to 12 
and sequencing each of the separated DNA molecules, carrying 
out sequence specific primer (SSP) amplification analysis or 
carrying out sequence specific oligonucleotide (SSO) 
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analysis . 

15 . A method according to any one of the preceding claims 
wherein the mixture of DNA molecules is a mixture of alleles 
of a polyallelic gene. 

16. A method according to claim 15 wherein a reference 
allele is used which has the same serotype as at least one of 
the alleles in the mixture of alleles. 

17. A method according to claim 15 or 16 wherein the 
mixture of alleles is from a prospective recipient or a 
prospective donor in a tissue or organ transplant operation. 

18. A method for determining whether a prospective 
recipient in a tissue or organ transplant operation has 
alleles of a gene that are compatible with the alleles of a 
prospective donor in the operation, which method comprises 

(i) amplifying the alleles of the prospective 



recipient employing a pair of primers in which 
one of the primers carries a ligand, so as to 
produce amplified double -stranded alleles of the 
prospective recipient in which one of the 
strands carries a ligand; 



(iii) separating the double -stranded alleles into 
single-strands and removing the strands that are 
not bound to the support by the ligand ; 



(ii) 



contacting the amplified double -stranded alleles 
with a receptor on a solid support under 
conditions such that the ligand binds to the 
receptor; 



(iv) 



recovering the remaining strands from the 
support ; 
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(v) mixing the recovered strands with complementary 

strands of the alleles of the prospective donor 
so as to form test duplexes; 

5 (vi) separating the test duplexes by gel 

electrophoresis; and carrying out one or more of 
the following steps : 

(vii) comparing the positions to which the test 
10 duplexe migrate on the gel with the position of 

a control DNA molecule; 

(viii) sequencing the test duplexes; 

15 (ix) sequence specific primer (SSP) amplification 

analysis; and 

(x) sequence specific oligonucleotide (SSO) 

analysis . 

20 

19- A method according to claim 18 wherein the 

complementary strands of the alleles of the 
prospective donor are provided by 

25 (i) amplifying the alleles of the prospective donor 

employing a pair of primers in which one of the 
primers carries a ligand, so as to produce 
amplified double -stranded alleles of the 
prospective donor in which one of the strands 

3 0 carries a ligand; 

(ii) contacting the amplified double -stranded alleles 
with a receptor on a solid support under 
conditions such that the ligand binds to the 
3 5 receptor; 



(iii) 



separating the double -stranded alleles into 
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single- strands and removing the strands that are 
not bound to the support by the ligand; and 

(iv) recovering the remaining strands from the 
support . 

20. A method according to claim 18 or 19 wherein the 
prospective donor is selected to have alleles of the same 
serotype as the prospective recipient. 

21. A method according to any one of claims 18 to 20 
wherein the control DNA is a homoduplex between two strands 
of the same allele and migration of the test duplexes to the 
same position on the gel as the homoduplex indicates that the 
prospective recipient and the prospective donor have the same 
alleles . 

22. A method according to any one of claims 15 to 21 
wherein the alleles are of a human leucocyte antigen (HLA) 
class I gene or an HLA class II gene. 

23. A method according to claim 22 wherein the alleles 
are of the HLA-A gene, the HLA-B gene or the HLA-C gene. 

24 . A method according to any one of claims 2 to 4 and 11 
to 23 wherein the ligand is biotin and the receptor is 
streptavidin. 

25. A method according to any one of claims 2 to 4 and 11 
to 24 wherein the solid support is magnetic beads, and the 
strands which do not carry a ligand are removed by attracting 
the beads to a magnet and washing the beads under conditions 
such that the double -stranded DNA molecules dissociate into 
single strands . 

26. A method according to any one of the preceding claims 
wherein amplification of the DNA molecules is performed by 
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polymerase chain reaction 

27. A method according to any one of claims 17 to 26 
wherein the prospective recipient and the prospective donor 
are a prospective bone marrow recipient and a prospective 
bone marrow donor, or a prospective kidney recipient and a 
prospective kidney donor. 

28. A method according to any one of claims 1 to 27 
wherein the DNA molecules in the mixture of DNA molecules 
have the same number of nucleotides but different base 
sequences . 

29. A method for identifying a DNA molecule, which method 
comprises : 

(i) contacting the DNA molecule with a labelled 
reference DNA strand under conditions such that 
the reference strand hybridizes to a 
complementary strand of the DNA molecule so as 
to form a test duplex; 

(ii) running the test duplex and one or more control 
duplex (es) in a gel by electrophresis ; and 

(iii) comparing the position of the test duplex on the 
gel with the position (s) of the control 
duplex (es) . 

30. A method according to claim 29 wherein the control 
duplexes are either (a) duplexes which have faster and/or 
slower mobility than the test duplex and which are run in the 
same lane on the gel as the test duplex or (b) duplexes which 
have graded mobilities and which are run in a different lane 
on the gel to the test duplex. 
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(PCR) 



31. A method according to claim 29 or 30 wherein the 
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reference strand is labelled with a fluorescent label or a 
label suitable for attachment of an enzyme. 

32. A method according to any one of claims 29 to 31 
wherein steps (i) to (iii) are repeated for a second or more 
times and a different reference strand is used in each 
repeat . 

33. A method according to any one of claims 29 to 31 
wherein steps (i) to (iii) are repeated a second time using 
a second reference strand. 

34. A method according to claim 32 or 33 wherein the DNA 
molecule to be identified is an allele of a gene having from 
10 to 300 alleles. 

35. A method according to claim 34 wherein the gene has 
from 30 to 100 alleles. 

36. A method according to any one of claims 2 9 to 35 
wherein the gene is an HLA gene. 

37. A method according to claim 36 wherein the gene is an 
HLA class I gene . 

38. A method according to claim 37 wherein the gene is 
HLA- A, HLA-B or HLA-C. 
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